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Abstract 

In this paper, a unifying framework for orthogonal frequency division multiplexing (OFDM) mul- 
tiuser resource allocation is presented. The isolated seeming problems of maximizing a weighted sum 
of rates for a given power budget P and minimizing sum power for given rate requirements R can be 
interpreted jointly in this framework. To this end we embed the problems in a higher dimensional space. 
Based on these results, we subsequently consider the combined problem of maximizing a weighted sum 
of rates under given rate requirements R and a fixed power budget P. This new problem is challenging, 
since the additional constraints do not allow to use the hitherto existing approaches. Interestingly, the 
optimal decoding orders turn out to be the ordering of the Lagrangian factors in all problems. 

Index Terms 

Broadcast Channel, Multiple-Access Channel, Orthogonal Frequency Division Multiplexing (OFDM), 
Power Control, Water-Filling 



I. Introduction 

Resource allocation has become a crucial challenge in modem digital wireless communication 
systems, likewise for the downlink as well as for the uplink. Limited system resources such as 
power, bandwidth and transmit time have to be assigned in an optimal way, where optimality 
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depends very much on the requirements of the subscribers and the operator: The objective is 
to maximize a benefit (or minimize a cost) function such as throughput (or sum power) under 
certain constraints, which might be power, delay or other QuaUty of Service (QoS) constraints 
imposed by the system and the users' services. For example, a well known objective function 
is the weighted rate-sum, constrained by a given power budget. Choosing the weights to be the 
buffer sizes is closely related to the stability of the system [3], [4], but in principle the choice of 
the weights might be of any kind. On the other hand sum power is an important cost function, 
and fairness or target rates are common constraints [5]-[8]. These problems seem isolated and 
the connection between them - if any - remains unclear. 

Especially the first optimization problem is considered, when multiuser capacity regions are 
characterized: For the flat fading case, the delay limited and ergodic capacity regions of the 
multiple access channel (MAC) have been studied in [9], [10] using the weighted sum rate- 
approach. The equivalent broadcast channel (BC) was treated in [11], [12]. A general duality 
between MAC and BC for perfect channel state information was established in [13], allowing 
to translate results from one multiuser channel to the other. 

Since OFDM has been implemented in promising standards such as digital audio/video broad- 
casting (DAB, DVB), wireless local area networking (WLAN), and it emerged that OFDM is 
becoming a key technology in future 4G systems, resource allocation for OFDM has been 
investigated intensively [14]-[18]. Using the weighted-sum rate approach it was shown that 
frequency division multiple access (FDMA) is optimal to achieve the maximum sum rate for 
a sum power constraint [14] and for individual power constraints [17]. This is not the case 
for any weighted rate sum optimization problem, i.e. for other points on the boundary of 
the capacity region. The sum power minimization problem was studied in [7], leading to a 
nonconvex formulation. The general problem in finding the minimum sum power is, that the 
OFDM channel is a non-degraded channel: this makes it extremely complicated to determine 
the optimal decoding order. The problem does not affect the maximization of a weighted rate- 
sum, since the decoding order can be found exploiting the polymatroid structure of the capacity 
region. 

In this paper, we find the minimum sum power of the OFDM multiuser channel for required 
rates. We show, that a wide class of problems can be formulated in a unifying framework 
embedded in a higher dimensional space based on Lagrangian theory. This establishes a connec- 



tion between the optimization problems. Being aware of the coherence, we can derive efficient 
algorithms for all of these problems. Furthermore, we can combine the two antagonistic views 
of minimum rates and weighted rate sum-maximization to a more sophisticated problem, taking 
into account both necessities similar to [19], but for the frequency selective case. Furthermore, 
the resulting vector of Lagrangian parameters reveals the optimal decoding (encoding) order. 

The remainder of this paper is organized as follows: Section n describes the system model 
and Section III states the problem formulation. In Sections IV- VI the solutions to the specific 
problems are derived and the connections among them are depicted. Finally, we conclude with 
Section VHI. 

II. System Model 

In this paper, we use the following notation: Vectors and matrices are written bold face and sets 
are written in calligraphic letters. The notation A = diag^j^jafc} denotes the K x K diagonal 
matrix with the entries Ak,k = at on its main diagonal. Ej-} is the expectation operator and 
the /i-norm. A circularly symmetric Gaussian random variable Z = X + jY ~ CJ\f{Q, a^) is a 
random variable with i.i.d. real variables X, F ~ A^(0, and all logarithms are to the base 
e. 

We assume an OFDM system with K subcarriers, a base station and M users. Further, we 
suppose perfect channel state information at both sides. The set of users will be denoted by 

Ai = {1, M} in the following. The channel between the base station and any user m E 
is modeled as a frequency selective channel with taps. Denoting the Ith tap of the channel 
impulse response of user m as hjn[l], the channel on carrier k can be written as 

1=1 

We apply a cyclic prefix (CP) of length Lcp > max^-f/m to ensure orthogonality of the 
subcarriers and neglect the loss of spectral efficiency due to the CP assuming that it is small. 

Now we consider two different systems: the broadcast channel, where the base station sends 
independent messages to each user and the multiple access channel, where each user transmits 
independent messages to the base station. Assuming channel reciprocity the system equations 



on each subcarrier k can be written as 
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where (2) is the broadcast channel model and (3) the multiple access channel model. In each case, 
the signal is scaled by the channel gain and corrupted by circularly symmetric white Gaussian 
noise n ~ CA/'(0,(T^). Note, that the receiver thermal noise power is assumed to be equal in 
both cases. If the channel gains hjn,k — |^m,A;P same for the uplink and the downlink, 

we call MAC and BC dual. We assume perfect channel state information, that is, the channel is 
perfectly known at both ends in each fading state. Due to this assumption, the following is not 
restricted to our fading model but holds for any known channel realization. 

A. The OFDM broadcast and multiple access channel 

Let h = • • • , hi^K, ^2,1, ■ ■ ■ , hM,KY denote the stacked vector of channel gains and 
hjfc = {hi^k, hM,k)^ the vector of channel gains of all users on subcarrier k. It is known that 
with Costa Precoding the entire capacity region of the non-degraded broadcast channel, such as 
the OFDM BC, can be achieved. Note, that the OFDM BC region can also be achieved with 
Superposition Coding together with Successive Interference Cancellation [20], since it can be 
seen as a set of parallel Gaussian degraded broadcast channels. Let tt e 11 be an arbitrary Costa 
encoding order from the set of all possible encoding orders 11. Then user 7r(M) is encoded first, 
followed by user 7r(M — 1) and so on. The base station has full non-causal knowledge of the 
message transmitted to the previously encoded users and can take this knowledge into account 
when choosing the codeword for the following user. Denote the power of user m on carrier k 
by Pm,k = IE{|a;m,fcp}, the stacked vector of powers as p = . . . ,Pi,a',P2,i, • • • , ■■■,Pm,kV 
and the power vector of user m as = {pm,i, ■■■,Pm,K)^- Then the rate of user 7r(m) can be 
expressed as 




(4) 



The capacity region^ of the OFDM BC under a given sum power constraint P is given by 

7ren_ ^ ' 

\v\i<P 

where i?7r(m) is defined in eq. (4). Using the same stacked vector notation h, the capacity region 
of the dual MAC under the same sum power constraint P can be expressed as 

CmAc{\ P)= U (r : J] R^im) < 5Z log (l + ^ 5] hm,kPm,k) , S C m\ . (6) 
|p|i=P I meS k=l meS ) 

Assume ideal successive interference cancellation (SIC) and an arbitrary decoding order tt e IT. 

Then user 7r(l) is decoded first, followed by user 7r(2) and so on. An equivalent characterization 
of the OFDM MAC, using successive interference cancellation (SIC) is given by 



CMAc{h, P)= [j < R : Rn(m) = K{m) , meM 
Tren I 



(7) 
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where ^^^^^ is given by 

Kim) - E log f 1 + , — ] . (8) 

V ^Tr(m),fc L P7r(n),k J 

n>m 

Indicating the relation to the broadcast channel, this characterization leads straight to the duality 
of both systems, treated in the next section. 

B. Duality of OFDM BC and OFDM MAC 

A popular method to solve problems for the broadcast channel is to solve them for the MAC 
and to relate them to the BC by uplink-downlink duality. Such duality results between uplink 
and downlink were reported for quite a few scenarios in recent years [6], [13], [21], [22]. In this 
work, we exploit uplink-downlink duality as well. Hence, we state the OFDM duality result in 
the following. 



'in a slight misuse of the term capacity, we denote the spectral efficiency [bits/s/Hz] as capacity. Further, the expression 
capacity and instantaneous capacity (denoting the capacity of a single fading state of infinite length) are used equivalently 
throughout this paper unless specified in a differing manner. 



Lemma 1: Let CbcO^jP) denote the capacity region of an OFDM BC under a sum power 
constraint P and channel reaUzations h. In analogy, let CmacO^, P) denote the capacity region 
of the OFDM MAC with the same parameters P and h. Then both regions coincide, i.e. 

Cbc{\P)=Cmac{^,P). (9) 

Proof: It is easy to see that the OFDM BC can be written as a multiple input multiple 
output (MIMO) BC simply by stacking the vectors of the signals x„j = {xm^i, Xm^x)'^ ,ym = 
{Um,!, ■■■,ym,KV,i'^ = and setting = diag;^^{/i^ The same holds for the 

OFDM MAC. 

On the other hand we know from the line of work [21]-[23], that duality holds for the 
instantaneous capacity regions of Gaussian MIMO BC CBc{{H.m}m=i^ P) ^d Gaussian MAC 
Cmac({H^}^=i, P). Combining the reformulation of the OFDM BC (MAC) in terms of MIMO 
BC (MAC) with the duality result concludes the proof. ■ 

Remark 1: Note, that the duality result holds independent of the number of transmit and 
receive antennas, since any MIMO-OFDM system can be written in block-diagonal form [24]. 
Only the conjugate transposition has to be taken into account in case of more than one antenna. 

It is known from [13], [21] that both regions are achieved by circular symmetric complex 
i.i.d. Gaussian symbols Xm,k ~ CM{Q,Pm,k) and if a set of rates R e Csci)^, P) is achieved by 
a power allocation p^^ = (p^^^ ^, . . . ^)^, the same set of rates is achieved in the dual 

MAC by the power allocation 

^MAC _ BC 

(J^ + K{M),k En=l P^{n),k 

MAC _ BC tJ^ + h-,r(M),kP^(M)^k 

P^(M-l),fe - Pn{M-l),k 2 , > 2 BC 

^ "'■K{M-l),k l^n=\ y-K{n),k (iU) 



2 I v^M L BC 
^MAC _ ^BC ^ + Z^n=2 ^7r(n),fcP^(n),fc 
Pw{l),k — PT!-(M),k ^2 

Inverse relations - transforming the power allocations from MAC to BC - can be found in [13], 
Eq. (12). 

Note, that due to the established duality, it does not matter whether a problem is solved in 
the OFDM MAC or the OFDM BC, since the solution can be transfered from one to the other. 



Hence, we can choose the easiest formulation among MAC and BC for each problem. In the 
remainder of the paper, the subscripts -mac and -bc are omitted and all variables refer to the 
multiple access channel unless specified in a differing manner. 

III. Problem Formulation and global Lagrangian perspective 

In this paper, we want to solve the following three problems and relate them one to each 
other: 

Problem 1: [Weighted Sum Rates Problem] 

maximize /u^R 

(11) 

subject to Re C(h, P) 

This problem stems from an operators perspective. The objective is to maximize a weighted 
sum of rates subject to a sum power constraint P. The weights represent relative priorities. The 
most common is to take the buffer length weighted rate-sum, which is related to the stability 
region [25]. In principle, the weights /i can have any other interpretation. The approach yields 
a parameterization of the boundary of the capacity region. 
Problem 2: [Minimum Sum Power Problem] 

minimize P 

subject to Ri>Ri yieS,S CM (12) 
ReC(h,P) 

Minimizing the sum power, which is necessary to achieve certain rate requirements R = 
(^1, RmY, is the objective of the second problem. Since the system is power limited but not 
interference limited, any set of finite rates R can be achieved. A crucial point is to determine 
the optimal decoding order. 

Problem 3: [Minimum Rates Problem] 

maximize /u^R 

subject to Ri>Ri \/ieS,SQM (13) 
ReC(h, P) 

This problem is a combination of the two previous ones. It is more realistic and at the same time 
more sophisticated. The first condition reflects the minimum rate requirements R and the second 



stems from the limited power budget P. Once again, the optimal decoding order is unknown. 
Additionally, the question of feasibility arises, since not every set of required rates R may be 
supportable with the given power budget P. 

In the remainder of this paper, we will solve the three problems - each in frameworks - and 
show the relation between them. 

A. A global Lagrangian perspective 

We begin by deriving a unique framework for all problems in a higher dimensional space. By 
the concavity of the Zog'-function it is easy to prove the following lemma: 
Lemma 2: The set ^(h) = {(R, P) : R e C(h, P)} is a convex set. 

Obviously, the duality of MAC and BC holds for the enhanced set ^(h), too. By Lemma 2 
there are A e M+ and n e M.^ such that Problems 1-3 are equivalent to solving the following 
problem: 

max /Li^R - AP subj. to (R, P) e gih) (14) 

Note, that this reformulation is more than a formal step. So each point on the boundary of the 
set Q (h) can be parameterized as the solution to the above problem for a specific choice of the 
parameters (/^, A). Besides the Lagrangian interpretation as the direction of steepest descent, the 
vector (/X, A) can be interpreted as the normal vector of a tangent hyperplane to the solution. 
Further, the stated problem is convex, since the function in (14) is afflne and the set ^(h) is a 
convex set. 

Each of the Problems 1-3 can be interpreted in the introduced framework: In Problem 1 the 
Lagrangian vector jj, is known. The factor A is unknown a priori and can be seen as the power 
price. In contrast, in Problem 2 the vector jx constitutes the unknown Lagrangian vector of the 
rate requirements. It turns out, that the have the additional interpretation as individual water- 
filling levels for rate water-filling. The factor A = 1 is fixed a priori, where it could equivalently 
be fixed to any other positive real number; it results only in a scaling of the other Lagrangian 
parameters. This is equivalent to the normalization of the vector /li in Problem 1, since the 
scaling of the normal vector (/x, A) is irrelevant. Thus, the number of degrees of freedom is one 
less than the number of Lagrangian multipliers. 



The relation to the last Problem 3 will be pointed out in the remainder of this paper. All 
problems have in common, that we search for parts of the Lagrangian normal vector to the 
supporting hyperplane and parts of the optimal supported point on the boundary of ^(h). 



IV. Maximum weighted sum-rate for given power budget 

Although a well studied topic, we will derive an uplink solution to Problem 1 briefly for 
two reasons: First, the methods are of significant relevance for other systems such as multiuser 
MIMO, and further the derivation allows the characterization of orthogonal transmission modes, 
which are given in Lemma 3. 

The key to the reformulation as a convex problem is the polymatroid structure of the capacity 
region C(h, p) for a fixed power allocation p. The solution to the maximization of a linear 
function g{Ii) — fx^'R over a polymatroid is known to be the vertex corresponding to the 
decreasing ordering tt of the factors fXm of g(R.) [26]. Taking the union over all power allocations 
in (7) does not affect the decoding order and hence the optimal decoding order is known to be 
TT. This leads to a convex formulation, whose Lagrangian is given by 

M / h \ M K 

C{p, H, X,4,) = y] log 1 + ^M.fc <rn)^ + A(|p|i - P) + V V 4>m,kPm,k: 

^1 \ <^ + 1^ P<n),khnin),k ^^^^ 

\ n>m / 

(15) 

where ^rn,k is the Lagrangian multiplier due to the nonnegativity of the power Pm,k- Note, that 
jC{p, ljb,X,4>) is a convex function in p since due to the definition of the ordering tt we have 
Ai7r(m) > yU7r(m+i)- Hcncc, the Karush-Kuhn-Tucker (KKT) conditions are necessary and sufficient 
for optimality and any standard convex optimization method can be applied to solve the problem. 
A is the Lagrangian factor belonging to the power constraint and 4>rn,k is the Lagrangian multiplier 
due to non-negativity of Pm,k- By setting = //7r(m) - A«7r(m-i) for all m 7^ 1 and ci = //7r(i), 



the KKT conditions can be written as: 




,^^M ' , A + <t>m,k = (16) 



(17) 



(18) 

a-=i 

A > 0, > 0, 4)m,k > 0, Pm,k4>m,k = (19) 

for all m and A;. The KKT conditions will be used to derive Lemma 3. 

A. Algorithmic Solution 

An alternative algorithm much more efficient than standard convex optimization was presented 
in unpublished work [20]: Tse derived an algorithm for solving this problem for parallel Gaussian 
channels directly in the downlink. Since the OFDM BC can be interpreted as a set of parallel 

Gaussian channels (as long as the CP is long enough and orthogonality can be guaranteed) 
the algorithm can be carried over to the OFDM BC case. He introduces the elegant notion of 
marginal utility functions to characterize the revenue of each user to the objective function: 

^^H-) = -T^f^ - A (20) 
The set of equations characterizing the solution is given by 

oo 



k^u^* {z)=\raa.-Ki ui'\z)\^ 

P{\) = y Lax - 

^ L V A hm,k 



H + 



(22) 



where A is the Lagrangian parameter of the sum power constraint and [a]"*" := max(0, a). For a 
detailed study the reader is referred to [9], [20]. The solution can be found by solving equation 
(22) for the Lagrangian multiplier A. This can be done by simple bisection, since the RHS of 
(22) is monotone in A and equation (22) yields a simple expression for the sum power constraint 
P as a function of the Lagrangian parameter A. Subsequently, the relevant intersections have to 



be calculated, which is not a combinatorial problem, since the ordering of channel coefficients 
defines the ordering of intersections. Hence, Algorithm 1 presented below is more efficient than 
standard convex optimization and than fixed level-water-filling, which will be introduced shortly 
in Section V. 

Algorithm 1 Weighted rate sum maximization 
(1) solve (22) for Lagrangian factor A 

for A; = 1 to do 

(2) order users according to their channel gains 

(3) determine intersections of marginal utility functions (20) 
end for 

(4) calculate resulting rates (21) 



B. Special transmission modes 

It is desirable to know, in which cases orthogonal signaling is optimal due to the easy signal 
processing in this case. The arising problem is to find conditions for the optimality of FDMA and 
the single user range. The Karush-Kuhn-Tucker (KKT) conditions (16) - (19) from the convex 
formulation in the uplink scenario allow such a proposition: 

Lemma 3: Assume that the user weights are ordered, i.e. /xi > ... > /im > 0. Let the index 
of the user transmitting exclusively on carrier k be mfc. Then exclusive subcarrier assignment 
(FDMA), i.e. allocating the power P to user nik on carrier k for all k = 1,...,K is optimal if 
and only if 

max — > max < max(//„ - /imk,^) — + mm(//^^, /i. 



k 

(23) 

Corollary 1: Single user transmission, i.e. allocating the power P exclusively to user irisu is 
optimal if and only if 



k 



(24) 



Proof: From the first KKT condition (16) follows, that 4'rn,k > for users m ^ rrik and 
0mfc,ifc = on carrier k, since Pm,k — for all m ^ irik. Hence, the KKT-condition (16) for user 
rrik, i.e. the user transmitting exclusively on the k\h subcarrier is given by: 

T l^mk^mk,k 

For users with index m < nik we can derive analogously 

T l^mhm,k . 7 

^ — 7 r (Pm,k- 

^ ~T~ Pmk,k'T"mk,k 

In case m > m^, the KKT-condition results in 



^ — 7 r {Hm — l^mk) 5- + Vm,k 

since the interference term in the denominator vanishes for addends m > rrik. Thus, we get the 
necessary and sufficient conditions 

X = max „ — ^7 > max -^r—^ (25) 

,k'^mk,k ,k'^mk,k 

A = max > max + (/^^ - Hm^)hm,k ■ (26) 

+ Pmk,khmk,k "'>^>' Cr^ + Pmk,khmk,k 

Combining equations (25) and (26) leads to Lemma 3. The condition for the single user optimality 
range follows immediately from setting mi — ... = mx- ■ 

Remark 2: Note that in the case of sum rate, the condition in Lemma 3 is always fulfilled. 
Hence, FDMA achieved the full sum rate in OFDM BC and OFDM MAC systems. 

At first glance the condition in Lemma 3 seems complicated, since the powers themselves 
occur in the expression. Nevertheless, a simple water-filling algorithm using the weighted channel 
gains ijLmhm,k as decision criterion yields the optimal allocation in case of FDMA optimality, 
which is for example guaranteed in the low SNR regime. Once done, the condition can be 
checked. Even if FDMA is not optimal, the resource allocation algorithm yields excellent results 
as shown in [27]. 

V. Minimum sum power for given rate requirements 

In this section, we change perspectives and derive the solution to the minimum sum power 
problem for given rate requirements R. The main problem is, that the encoding order is not 
clear, since the exploitation of the polymatroid structure in section IV can not be transfered 



to the sum power minimization problem. In order to simplify the problem, we will choose the 
uplink perspective one again, such that the unknown encoding order is equivalent to the reverse 
SIC order. 

A recent approach [7] proposed to divide the problem in two subproblems: first, finding the 
optimal decoding order and second, solve the problem for a fixed decoding order. We show, that 
the split into two subproblems falls short of the structure of the problem, and both problems can 

be solved within one single framework. 
Problem 2 can be written as 

min IpIi 
TTsn 

SUbj. to log (l + Mm),kKim),k \ ^ ^^^^^ (^7) 

Pm,k > 

where all powers Pm,k are nonnegative. Unfortunately this formulation is not convex, since the 
constraints are not convex in p. 

First, we reformulate the problem in terms of the rate Rm,k- To this end, solving 

D — Ir^n- I 1 J P'K(m),kh'K(m),k \ 

^■K(m),k - log i + r (28) 

V ^ + Z^n>mP<n),kllns{n),kJ 

for P7r(TO),fc yields after some manipulations 

P7r(m),fe = -r~ (®^P (-^T(m),ifc) " l) exp ( R^^n),k ) ■ (29) 

Now with the substitution (29), the problem can be written as 

• (®^P (-^7r(m),fc) - l) [ D 

m=l k=l 7r(,m;,/e/ \n>m / 



K 



(30) 



SUbj. to ^ Rm,k > Rm 
k=l 

Rm,k > 0. 

Note, that the constraints are linear now, but the objective function in (30) is not convex any 
more. To fix the decoding order vr a priori does not help to solve the problem, since the objective 

function is not convex independent of the decoding order. Surprisingly, this holds even for the 
optimal decoding order. 



To proceed further and to circumvent this problem, we need the following lemma: 
Lemma 4: Let C(h, P) denote the capacity region of an OFDM MAC under a total sum power 
constraint P . The OFDM MAC capacity region C(h, P) can be expressed as 

K 

C(h,P)^ U 5^C-'=(h,,p(^)) (31) 

^f^jP(fe)=p fc=i 

where = {h\^k^ . . . , hM,k)^ and tt^ e 11 are decoding orders such that 

Kk{l),k > ■■■ > Kk{M),k: V/C, (32) 

and 

C-Hh..P)^ U {r : < log (l + , f"""'';"r'fe , > m g (33) 

_p V Z^n>m^T/cH"'Tfc(n)/ J 

Z^m Pm—r 

is the achievable rate region for a given decoding order tt^. 
Proo/- We know from [9], [20] that 

K 

C(h,P)= U 5]C(h,,p('^)). (34) 
Further, the optimal decoding order for a scalar Gaussian MAC is given by -Kopt such that 

Kopt{l),k > > Kopt{M),k V/c. (35) 

With this decoding order, every point of the capacity region C{hk, P^*^)) of the degraded channel 
can be achieved; this is well known, see e.g. [9]. Hence, in the special case of a scalar MAC 
(i.e. on each subcarrier), the capacity region coincides with the region C'^°'"(hfc, P'^'^)), i.e. 

C(hfc,P«)=C"-*(hfc,p('=)). (36) 

Substituting (36) in (35) completes the proof. ■ 
Lemma 4 states, that the OFDM MAC (the same holds for the OFDM BC, see [20]) can 
be decomposed to a union over the sum of scalar MACs, which are MAC channels where 
the whole region can be achieved with a single decoding order. Each scalar MAC represents 
a specific subcarrier. This representation will turn out to be the key to an efficient solution to 
the sum power minimization problem, since it circumvents the problem of finding the globally 
optimal decoding order. Motivated by Lemma 4, we introduce an individual decoding order on 
each subcarrier denoted by nk, such that 

h-Kt,{i),k > ■■■ > KkiM),k, V/c, (37) 



and define coefficients 

Cm,k = CJ' [Kl^mU - Kl(m-iu) Vm G At \ {1} (38) 

and 

= Kl{i),k (39) 
which are non-negative due to the definition of the permutations tt^. Reordering the addends and 
observing that the remaining term —h~^f^j^^ ^. does not alter the solution we arrive at 

M K / 

mill ^ ^ Cm,k exp j R7rUn),k 
m=l k=l \n>m 

^ - (40) 

SUbj. to 2_^Rm,k > Rm 

k=l 

Rm,k > 0. 

Note, that the objective function in (40) is log-convex, since it is a nonnegative sum of log- 
convex functions. The constraints are linear and the function is defined on a convex set, hence 
the problem can be solved by standard convex optimization methods. 
Let >C(R, /i, 4>) be the Lagrangian given by 

M K / \ M / K \ M K 

£(R, fx, 4>) = ^ ^ Cm,k CXp ( ^ R-Kk{n),k j " ^ A^m I ^ Rm,k — R-m j " ^ ^ 4>m,kRm,k 
m=l k=l \n>m / m=l \A;=1 / m=l fe=l 

(41) 

where /i^ and 4>m,k are non-negative Lagrangian multipliers corresponding to the individual rate 
constraints and the non negativity of the rates. The Karush-Kuhn-Tucker (KKT) conditions are 

necessary and sufficient for the optimality of a given set of rates and given by 



(^s,k exp [ ^ -R7rfc(n),fe j - 0m,fe = (42) 
K 

Y,Rm,k-Rm>0 (43) 
fc=l 

^Rm,k-Rrn]=0 (44) 

k=l / 



s=l \n>s 

K 



P'm^O, <i>m,k > 0, Rm,k > 0, <Pm,kRm,k = (45) 

for all m and k. The KKT-conditions offer the possibility to design an appealing algorithm based 
on water-filling, which is computationally much less demanding than convex optimization. 



A. Optimal decoding order 

Up to now, the important question of the optimal decoding order remains open: once the 
problem is solved and a solution to the set of equations (42)-(45) is found, i.e. the optimal 
power allocation is found, the optimal decoding order can be identified: 

One can simply go through the set of subcarriers and check the optimal power allocation on 
each carrier. For each carrier, a nonempty set of optimal decoding orders exists. These are 
given by all TTjt e IIopj, such that m<n implies 

hTTk{m),k > hT,^(^n),kt m,n G Sk 

where Sk = {m: Pm,k > 0}. 

In other words, is the set of decoding orders, which do not change the order among all 
transmitting users. The set of optimal decoding orders Hgpt is the intersection of all these sets 

Hop* = n ng- (46) 

fce/c 

In most cases, the set ligpt reduces to one single decoding order TTopj. Since this is a rather heuristic 
but intuitive explanation, we will make it more rigorous: The ordering of the Lagrangian factors 
jxm yields the optimal decoding order i^opt such that 

l^-Kovt{l) > ••• > i^^opt{M)- (47) 

This follows directly from the fact, that the capacity region of the optimal power allocation is a 
polymatroid. Now we have the following interpretation: For each point R on the boundary of 
C(h, P), there exists a global decoding order n^pt such that, applying this decoding order on every 
subcarrier, R is achieved. On the other hand Lemma 4 reveals, that the capacity region is the 
union over individual MAC regions, whose optimal decoding orders are known. Thus, there exists 
at least one set of individual decoding orders {77^}^^^, such that, applying decoding order tt^ on 
subcarrier k, the same point is achieved. Applying subcarrier-wise individual decoding orders 
does not increase the achievable rate region due to Lemma 4. Hence, the entire capacity region 
can be achieved by a set of decoding orders {7?^}^^ applied to each subcarrier individually. 
Note, that the assumption of individual decoding orders is made implicitly in [20] by introducing 
marginal utility functions. The decoding order on subcarrier k corresponds to the ordering of 
the sets An,k = {z e [0, 00) : Un,k{z) = maxjgx Ui^k{z)}. 



B. Algorithmic solution 

To derive an efficient algorithm, we define coefficients nm,k as 



nm,k = log <{ exp j R-Kk{n),k 
\n>m 
m— 1 

a 



a 



h 



TTfe {m),k 



m— 1 



\n=j+l 



(48) 



which can be seen as efficient noise levels. Then the Karush-Kuhn-Tucker (KKT) condition in 
eq. (42) can be written in a form allowing water-filling: 



R 



Tfe(m),fc 



log(/im) - nm,k 



Vm, k 



(49) 



Note, that additionally to the interpretation of /t^ as part of the normal vector of the tangent 
hyperplane, the term \og{p,m) is the water-filling level of the rate water-filling. 

This motivates Algorithm 2, which shows excellent convergence behavior due to the log- 
convexity and the semi-analytic stepwise solution through water-filling. 

Algorithm 2 Iterative "multi-level rate water-filling" 

set Rm,k — Vm e M, k — 1, K 

while desired accuracy is not reached do 
for m = 1 to M do 

(1) compute the coefficients nm,k (48) for user m 

(2) do water-filling with respect to the rates Rm,k for user m as in equation (49) 
end for 

end while 



Theorem 1: Let R = (-Rij, Rm,kY be a vector of rates such that equations (42)-(45) are 
fulfilled. Algorithm 2 converges to R. 
Proof: We define the set of rates 

Tl = {Rra,k ■ Rm,k < mSiX Rrn rU G M} . (50) 

m 

Since all rate requirements are finite per definition, 7?. is a compact convex set. The sum power as 
a function of the actual rates P(R), R e 7?. is continuously differentiable on TZ and bounded from 
below. The constraints are separable. In each step n of Algorithm 2, the unique water-filling 



solution with respect to one specific user m is found by fulfilling the individual optimality 
conditions of this partial problem. So we know that 

p(R)(n+l) < p(R)(") (51) 

where P(R)(") is the sequence of sum powers. Hence, P(R)(") is monotonically decreasing and 
bounded from below. Taking subsequences yields a convergent sequence, which has a limiting 
point R. It is shown in [28], that due to the uniqueness of the solution to each partial step any 
limiting point is a stationary point. Thus, by the convexity of the problem, each local optimum is 
a global optimum and hence we conclude, that -P(R) is the minimum sum power. Furthermore, 
if all channel gains on each subcarrier are distinct, which occurs with probability 1, also the 
sequence of rates converges to a unique solution. ■ 
The convergence behavior is illustrated in Figure 1. The decoding orders 2 — > 3 ^ 4 — > 1 
and 2 ^ 3 ^ 1 ^ 4 turn out to be optimal: the users 1 and 4 do not interfere on any subcarrier. 
It can be seen very well, that the algorithm from [7] shows an unsteady convergence behavior 
due to nonconvexity. 

VI. Minimum rates and given power budget 

With the insights of the preceding sections, we turn to Problem 3. Although the problem is 
very similar to the case studied in [19], the OFDM case is more complicated, since there is 
no analytical expression for the minimum sum power needed to fulfill the rate requirements. 
Although not obvious at first glance, Problem 3 can also be interpreted in terms of (14) and 
thus is a convex problem, which will be shown in this section. Additionally - and indicating 
the difference to the two previously considered problems - the question of feasibility arises. Not 
all rates might be supportable with a limited power budget P. Hence, we will have to check 
feasibility. Another difficulty is, that the decoding order is not obvious. Although the weights // 
are known, they do not determine the optimal decoding order, since they constitute only a part 
of the tangent hyperplane normal vector. 

A. Convexity and Feasibility 

First, we throw a glance on the questions of convexity and feasibility. To this end, we denote 
the feasible set of Problem 3 by 

TZfih, P) = {R : R e C(h, P),Ri>Ri i e S} (52) 



where S is the set of users with rate constraints. 

Lemma 5: The feasible set 7?./(h, P) of Problem 3 in (52) is a convex set. Further, Problem 
3 in (11) is feasible, if and only if 



where P is the power budget and Pmin is the solution to Problem 2 with the same given rate 
requirements R as in Problem 3. 

Proof: The set C(h, P) is a convex set per definition for all h and P, since we can perform 
time sharing. The sets TVf — {R : Ri > Ri i E S} are half spaces bounded by a hyperplane 
and hence convex sets. Then 



is the intersection of convex sets and thus a convex set. 

A necessary and sufficient condition for feasibility is the nonemptyness of the feasible set. 
Assume a nonempty feasible set, e.g. R e C(h, P) and P < Pmin- Per definition the rate vector 
R lies on the boundary of C(h, P^j„). Since P < Pmin', the region C(h, P) is strictly smaller 
than C{h,Pjnin)- This leads to a contradiction, since R e C(h, P) and R lies on the boundary 
of a strictly larger region. ■ 
This result provides a simple method to answer the question of feasibility. One just has to solve 
the sum power minimization problem in (13) with Algorithm 2 for the required rates R. 

The Lagrangian of the problem is given by: 




(53) 
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(54) 
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where we substituted the rates: 




B. Algorithmic solution 

In this section, we derive two algorithms for Problem 3. Each is derived from one of the two 
concurring perspectives. The basic idea of Algorithm 3 is to iteratively increase the user weights 
such that rate requirements are fulfilled in each step: 

Algorithm 3 Minimum Rates Algorithm 

(1) check feasibility with Algorithm 2 

(2) initialize fx* — jx 

while desired accuracy not reached do 
for m = 1 to M do 
if Rm < Rm then 

(3) increase /i^ such that R^ — Rm using Algorithm 1 
end if 
end for 
end while 



Theorem 2: Algorithm 3 converges to a stationary point R*, which is the global optimum of 
Problem 3. 

The proof is based on the following lemma, which can be verified considering the expression 
for the rate Rm (21). It is worth noting that the situation is different from that in [20], since the 
Lagrangian factor A has to be adjusted due to the fixed sum power budget. 

Lemma 6: For all m and a fixed sum power P, if the mth component fim of the Lagrangian 
vector is increased and the other components are held fixed, the rate RmilA remains the same 
or increases while Rni/ji) decreases for m. 

Proof: The rates Rn^k (21) depend only on the size of the corresponding sets 

An,k = {z e [0, oo) : Un,k{z) = maxMj ^(2;)}. (55) 

Obviously, an increase of jim does not influence the sets or causes them to shrink for all n 7^ m 
and for all k. It remains to show, that an increase of jim can not reduce the set size of the 

same user m on any subcarrier. Distinguish two cases: if m 7^ argmax„g;v) ZQ{n) : tin,kizo) = 0, 
i.e. the marginal utility function of user m does not constitute the last intersection on any 



subcarrier k, the parameter A remains unchanged and hence the sets Am,k can not shrink. If 
m — argmax^gx zo{ri) : Un,k{zo) = on at least one subcarrier k, then A increases to meet the 
sum power constraint. This reduces the maximum intersection Zo{n, k) on all other subcarriers. 
On the other hand, the intersection of the marginal utility function Um,k of user m is shifted by 
any A//^ and AA about the same offset 

= (AA + A)A ^''^ 
independent of the carrier k. The important observation is that this shift is thus equal for all 
carriers and nonnegative, since the power constraint has to be met and all other carriers experience 
a power decrease. Hence, the rate of user m can not decrease. ■ 
Now, we can proof Theorem 2: 

Proof: [Theorem 2] Assume that Problem 3 is feasible. Hence, there exist A e R+, e 
fx e such that the KKT conditions are fulfilled. Let fx* — fjt + p,. From (21) it can be seen, 
that Rm{^*m) is a monotone function in jj,*^ for any fixed sum power P, where the power price 
A is chosen such that the sum power constraint is met. Further, 

lim R^{,x*J = i?- (57) 

where i?*" is the single user water-filling solution for user m and i?^ > R„i since the problem 
is feasible by definition. Then in iteration n + l one can find a such that Rm^^^ — Rm in 

case that Rm^ < Rm- Using Lemma 6, /li*^"^ is a component-wise monotone sequence. Define a 
mapping U representing the update of the sequence /x*^"^ Lemma 6 guarantees that U is order 
preserving. Starting with /Lt*(°^ = /Lt we know that < 7^*^^ for any n, where 7^*^^ > fx 

is the solution such that the KKT conditions are fulfilled and a > b refers to component-wise 
greater or equal. Thus, /Lt*^^ is a fixed point of the mapping U. Hence, {/x*^"'^} is a component- 
wise monotone sequence bounded from above by /x^^ and converges to the limiting fixed point 
A*^t ^^^^ KKT conditions are fulfilled and the associated rate vector R*(/u^t) is 

achieved. ■ 
Figures 3 and 4 illustrate the convergence process for an exemplary random channel with 
K = 256 subcarriers and M = 4 users. Note, that the algorithm has the same disadvantage as 
the sum power minimization algorithm based on the BC perspective described in Section IV: 
For any finite n, the resulting point R^**) is infeasible. Each iteration consists of a loop over 



all users. Nevertheless, the algorithm offers a considerable speed up compared to the alternative 
Algorithm 4 based on multilevel rate water-filling presented below. This algorithm is based on 
eq. (14). Depending on whether the rate constraints are active or not, the water-filling level is 
adjusted such that the requirements are met. 

Algorithm 4 Water-filling based Minimum Rates Algorithm 

(1) check feasibility with Algorithm 2 

(2) choose initial Lagrangian factors A+ and A_ 
while sum power constraint P is not met do 

while desired accuracy not reached do 
for m = 1 to M do 

(3) compute the coefficients ^ (48) for user m 

(4) do water-filling with fixed level jirn (49) 
if Rm < Rm then 

(5) choose water-filling level \og{iirn + P^m) such that Rm — Rm 
end if 
end for 
end while 

(6) increase (decrease) A if P > P (P < P) by bisection 
end while 



We have the following theorem ensuring convergence: 

Theorem 3: Algorithm 4 converges to a stationary point R*, which is the global optimum of 
Problem 3. 

Proof: First we show that in case of feasibility for any fixed A the inner loop converges: 
for any fixed A, the set of rates fulfuUing the rate constraints TZ as defined in (50) is a compact 
convex set. The objective function g'(R) = /i^R is continuously differentiable on TZ. In each 
partial step, the unique solution to the convex optimization problem of one specific user m is 
found. Thus, the same argument as in Theorem 1 holds ensuring convergence to the global 
optimum R(A) due to convexity of the problem. 



For the outer loop to converge it remains to show that P{X) is monotone in A. This can be 
seen as follows: for each A we have to solve 

/ (Vj = max ^ix'^K (P) - AP} (58) 

where R (P) is the solution to Problem 3 for power budget P. Eqn. (58) can be lower bounded 
by 

/ (a) > max {CoRo (P) - AP} = f, (a) 

where Ro(P) is any single user solution and Cq is a constant depending on the weights. Since 
/i(A) ^ cxo for A it follows /(A) oo for A — > where it is clear that this can only 
happen if P — > oo. Thus P — > oo as A ^ and since the rate and power allocation is unique, 
A is unique for given P and a smooth function; hence, the claim follows. 

Consequently, since the inner loop converges for any fixed value of A and the sum power P is 
monotone in A and thus can be found by simple bisection. Algorithm 4 converges to a stationary 
point R* being the global optimum. ■ 

C. Lagrangian interpretation and optimal decoding order 

Define the vector /u* = (/xi + jli, jiM + I^m)^ ■ The components jim are the weight factors 
from (11) and the components pirn are the Lagrangian multipliers of the rate constraints in (11). 
Further A is the Lagrangian multiplier due to the power constraint P. Then the vector (/x*, A)^ 
is the normal vector of the supporting hyperplane to the region ^(h) and the supported point 
is the solution (R*,P) to (11). The Lagrangian factor is strictly greater than zero if and 
only if the rate constraint of user m is active. Then /i^ delivers a revenue to the corresponding 
normal vector of the supporting hyperplane, assuring the minimum rate. This becomes apparent 
in Figures 5 and 6, where at ~ 12dB the constraint of the first user with jii — 0.35 becomes 
active. As the SNR approaches the minimum sum power, all existing constraints R become 
active and the Lagrangian multiplies grow without bounds. 

To derive the optimal decoding order, we focus on the Lagrangian interpretation: we know 
that the vector R* is the solution to the optimization problem: 

max /i*^R subj. to R e C(h, P) (59) 



The point is achieved with a decoding order tt* such that 

1^1* (M) ^ 1^1* (M-l) > • • • > 1^1* (1) (60) 

and the solution R* is the vertex R'^* of the corresponding polymatroid. Note, that the optimal 
decoding order can not be determined a priori, since the required weight vector fj,* — including 
the Lagrangian factors p, corresponding to the rate requirements — is not known. The weights 
fjb constitute only a part of the overall normal vector. Hence, the optimal decoding order is part 
of the solution. This interesting fact is in analogy to [1], and is illustrated in Figure 5. Note, 
that there are even for an example with M — A users five different optimal decoding orders 
depending on the SNR. 

VII. A UNIFYING FRAMEWORK 

It is apparent, that all problems can be solved by algorithms based on iterative multilevel 
rate water- filling, avoiding convex optimization: In Problem 1, the water- filling levels are known 
in advance and we perform multi-level rate water-filling with fixed levels. In Problem 2, the 
rate requirements lead to classical multi-level water-filling. Finally Problem 3 is solved by 
combining these two techniques: fixed-level water-filling with an additional optional step ensuring 
the minimum rate (and hence not to leave the feasible set) with traditional water-filling. This 
interesting fact is due to the joint problem formulation in eq. (14) on the enhanced set ^(h). Its 
KKT-conditions characterize the solution to all problems. 

A slight disadvantage of the algorithms for Problems 1 and 3 is, that the adequate Lagrangian 
factor reflecting the sum power constraint is not known: hence, for each step of bisection, e.g. 
each choice of A, the iterative water-filling algorithm has to be run. This is due to the fact, that 
the reformulation in terms of the rates does not allow an explicit incorporation of the sum power 
constraint. The minimum sum power algorithm is not affected by this fact, since the Lagrangian 
factor A is fixed a priori. 

On the other hand the downlink formulation from [20] delivers a different class of algorithms: 
They exploit the marginal utility or rate splitting formulation. The resulting algorithms for Prob- 
lem 2 and 3 have the slight disadvantage, that for any finite number of steps, the corresponding 
rate/power allocation does not meet the constraints. Further, the adjustment of the Lagrangian 
vector fi requires the calculation of the intersections of the marginal utility functions on all 
subcarriers in each step. 



VIII. Conclusion 

In this paper, we considered two fundamental resource allocation problems for OFDM mul- 
tiuser systems: the sum power minimization problem for required rates and the maximization of 
a weighted rate-sum for a given power budget. In principle, the OFDM BC is a non-degraded 
channel. We derived the solution to the minimum sum power problem by expressing the dual 
OFDM MAC channel as a union over the sum of degraded channels. This allows to formulate the 
problem as a convex optimization problem. We showed the coherence of these two problems by 
interpreting both problems as special cases of an optimization problem in a unifying Lagrangian 
framework. To this end, we introduced the enhanced set ^(h). This framework allows to 
design alogrithms based on rate water-filling for both problems. In principle, all problems can 
alternatively be solved in the BC directly using the notion of marginal utility functions stemming 
from [20]. With these insights, we addressed the mixed problem of maximizing a weighted sum 
of rates under given rate requirements and for a given power budget. Once again, we derived 
an algorithm from each perspective and showed that the sum of weights /x and Lagrangian 
multipliers fx expanded by the power price A constitutes the normal vector (/u + /i, A)^ to a 
tangent hyperplane in ^(h) at the solution. Moreover, the ordering of the Lagrangian multipliers 
reveals the optimal encoding order for dirty paper coding. 
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Fig. 1. Comparison of Algorithm 2 with the algorithm for fixed decoding order tts presented in [7]. Random channel with 
K = 128 subcarriers, M = 4 users and R = (2.5 0.4 0.8 2) [bps/Hz]. 
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Fig. 2. Feasible set of a random channel with 256 subcarriers and rate requirements Ri > 2 and ii2 > 1 at lOd-B. The vectors 
fi and /X* illustrate the intial weight vector and the resulting Lagrangian vector fi* — fj, + p,. 
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Fig. 3. Convergence of the Lagrangian /x* for a system with M = 4 users, K — 256 at lOd-B with ^ = (0.35 0.4 0.1 0.15)"^ 
and required rates _R = (1 1.25 0.5)"^. 




Fig. 4. Convergence of R for the same system as in Figure 3. 




Fig. 5. /I* = (/X + p.) for different SNR and the same settings as in Figs. 3 and 4. The four SNR-values, where the optimal 
decoding order changes, are marked with dash-dotted hnes. 




